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(54) Abstract Title 

Portal to allow access to web pages using different formats 

(57) A content-aggregating portal is provided for aggregating content from multiple content providers that 
each make available content in XML format. The portal as well as aggregating content as appropriate to satisfy 
user requests, also uses XML style sheets to apply transformations to the XML-based content to effect 
adaptations such as user device adaptation (Eg WAP, PDA, DESICrOP), user preferences adaptation, or 
location-based adaptation. The content providers provide their content according to a common XML semantic 
so as to enable the portal to provide consistent user responses when aggregating from several content 
providers. The common XML semantic may be specified by the portal operator or informed by related actions 
as specified in a services registry. In a simpler embodiment, a content-indexing portal is provided. 
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1 

Distributed Content Indexing and Content Aggregating Portals 



Field of the Invention 

The present invention relates to distributed content indexing and content aggregating 
5 portals 

Backgronnd of the Invention 

Today, many mobile services have to be implemented by taking information initiaUy 
created for presentation focused Jitml documents and converting to .wml ("Wireless 
10 Ma^]alpI^guage^partoftheWAP-WirelessAppUcationProtocol-specification).This 

approach results in the content providers making available two different forms of each 
document, one for fixed usras (html documents) and the other for mobUe users (wml 
documents) -see Figure 1 of the accompanyingdrawings. While utUities exist to automate 
this conversion it is not possible to fully automate a guaranteed successful solution. Thus, 
15 forexample,Everypathhas acleversolutionwherehumansareusedtoidentifywhatparts 

of web pages need to be conveyed to amobile user. These are marked and then the system 
performs the conversion in real-time. This can work well until the origmal web site is re- 
designed. Other, more automated sohitions, typically disappoint. The result is that good 
presentation in .wml comes &am good original .wml encoding, done specifically for 
20 mobile and therefore a dual (fixed and mobile) content eco-system exists. Here, the 
original information provider is adversely impacted by the diversity. 

In Japan, NTT DoCoMo is currently running a very successful mobile e-services business 
in Japan based on "i-mode" which can be characterized as a combination of technological 

25 and business strategies that together give mobile users a very good information-access 
experience. I-mode uses a simplified form ofhhnl for content known as "c-htinl". One of 
the important business components has been to assemble a very large amount of valuable 
content, adapted for presentation in a ti^tly specified environment, to form tiie core of an 
offering. The original, critical mass, core pulled in thousands of other content providers to 

30 create a single veryviable ecosystem. While c-htinl could be deployed in other markets it 
would not have the same impact simply because the challenge to build a critical mass 
would be huge. As already indicated, tiiese markets already suffer fix)m multiple content 
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types with web, WAP and sometimes SMS fomis. Content providers find this situation 
unacceptable and will resist all suggestions of another presentation focused content 
encoding, especially with the eventual practical requirement to migrate to XHTML. 
XHTML, sometimes called the future of the Internet, obeys the grammar rules of XML and 
conforms to the vocabulary of HTML. Presentation focused, it is expected that XHTML 
will be widely adopted for content that is presented on PC browser applications. As such, 
automated content conversions from material destined for big screens to small screens will 
be easier. But, it does not address the fundamental concem over how to automate the 
impact of content presentation equally on a large PC screen as well as a small one on a 
mobile phone. This is recognised in the creation of basic XHTML with a restricted set of 
capabilities for small screen devices 

Jn fact, XML can be used as a potential single source of Jitml and .wml documents. 
Content providers can automatically create the various encodings from a single source 
using XML style sheets. These XML style sheets can be used to select items fix>m a 
docimient and convert the docimimt to another form. For example, they can be used to put 
device specific formatting in the document as well as filter a docimient to provide only 
location specific information. 

The logic of style sheets can be represented as follows: 

X:SL[XML] XML 
that is, a style sheet used on an XML document can produce another XML document 

This notation can be used to show conversion style sheets; 

XSLhm,i[XML] ,html 
that is, a style sheet used on an XML document can produce an html document 

Similarly: 

XSLwm![XML] .wml 
that is, a style sheet used on an XML docimient can produce a wml document 

Hence to perfomi locaUsation and device type characterization, the following operation 
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can be performed: 

XSI^l[XSI^cc_type[XSI^sition[X^ ,Wml 

For a content provider who has the sources of information in XML, and who wants to 
5 produce good Web and WAP services, this is a good startmg point. One encoding and then 
multiple style sheets produces all the content type encodings. They could even add a c- 
html sty|e sheet. While there are some shortcomings with a pure XML story, good control 
of all aspects of information creation as well as style sheet creation will get round them. 
Specifically, there are problems in the area of semantic meaning beyond that possible with 
10 locally defined DTDs (Docimient Type Definitions). Furthermore the content provider 
takes responsibiUty to directly provide tiie content, notably the presentation, to the mobile 
and therefore the role of a portal/intermediaty, like an i-mode service, is simply that of an 
index. 

15 A content provider with just html has a number of migration routes to provide mobile 
content. The content provider could provide parallel .html and .wml as described above. 
Alternatively the content provider could go directly to a core XML original and use 
stylesheets to create the .html and .wml (see Figure 2) 

20 It is an object of the present invention to provide improved ways of providing contmt to 
users through portals. 

Summary of the Invention 

According to one aspect of the present invention, there is provided a method of providing 
25 content to users via a commtxnication infi-astructure, wherein a plurality of content 
providers each make available content in XML format, and an portal operator provides a 
content-indexing portal through which users can access content fi^om the content providers, 
the portal using XML style sheets to apply at least one of the transformations to XML- 
based content retrieved fi^om a content provider in response to a user request: 
30 - user device adaptation of the content; 
user preferences adaptation; 
location-based adaptation; 
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the content providers providing their content according to an agreed semantic meaning for 
the XML whereby to enable the portal to provide consistent user responses in respect of 
content from said plurality of content providers. 

5 According to another aspect of the present invention, fliere is provided a method of 
providing content to users via a communication infrastructure, wherein a plurality of 
content providers each make available content in XML format, and an portal operator 
provides a content-aggregating portal through which users can access content fit)m the 
content providers with the portal serving to aggregate content from multiple content 
1 0 providers as appropriate to provide a response to a user request, the portal using XML style 
sheets to ^ply at least one of the transformations to XML-based content retrieved from a 
content provider in response to a user request: 
user device adaptation of the content; 
user preferences adaptation; 
15 - location-based adaptation; 

the content providers providing flieir content according to an agreed semantic meaning for 
the XML whereby to enable the portal to provide consistent user responses when 
aggregating from multiple said content providers. 

20 According to a ftirther aspect of the present invention, there is provided a method of 
providing s^^ces to users via a communication infrastructure, wherein a plurality of 
service providers each make available content in XML format associated with services, and 
an portal operator provides a portal through which users can access content fix^m the 
SCTvice providers, the portal using XML style sheets to apply at least one of the 
25 transformations to XML-based content retrieved from a content provider in response to a 
user request: 

user device ad^tation of the content; 
user preferences adaptation; 
location-based adaptation; 
30 the service providers providing their content according to an agreed semantic meaning for 
the XML whereby to enable the portal to provide consistent user responses in respect of 
services from said plurality of service providers. 
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Brief Descriptioii of the Drawings 

Embodiments of the invention will now be described, by way of non-limiting example, 
with reference to the accon^anying diagrammatic drawings, in which: 
. Figurcrl is a diagram of a prior art approach to content production for web and 
mobile users; 

• Figure 2 is a diagram an alternative prior art approach to content production for web 

and mobile users; 

• Figure 3 is a diagram illustrating the provision of content in a standard form XML; 

• Figure 4 is a diagram of a content-indexing portal embodying the invention; 

. Figure 5 is a diagram showing XML data flows for a content-aggregating portal 

embodying the invention; 
. Figure 6 is a functional block diagram of the portal subject of Figure 5; 
. Figure 7 is a diagram a the Hewlett-Packard Mobile E-Services Architecture; 
« Figure 8 is a diagram of a services portal embodying the invention; 
. Figure 9 is a diagram illustrating a service access operation of the Figure 8 services 

portal. 



20 



Best Mode of Carrying Out the Invention 

Establishing Content Eco-Svstem Control 

25 To provide a coherent content eco-system, existing XML users and html core users, would 
be encouraged to make their content available as XML as depicted in Figure 3. The 
operator (or possibly another party) would ensure the meaning conveyed in the XML is 
consistent and would define a single standard for the vocabulary, for example be providing 
or specifying a small set of Document Type Definitions (DTD). For simple services, at 

30 start-up, this could be a few pages in length. Existing XML users would require an 
additional style sheet to convert the XML docimients, specifically to assign the right tag 
names. Core html users would need to be encouraged to export their content in XML. 
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While they could invent their own XML structures, the operator standard would provide a 
very attractive starting point. Few other companies could take this role. 

The purpose of XML in this arrangement is to give a stractured document that can easily 
5 be searched and later on accessed for content aggregation. There are advantages to the 
operator taking the responsibility for the meaning in the vocabulary and this could» at least 
in part, be done by referencing existing and upcoming XML based standards that cover 
difference aspects of content syndication, including NewsML and NITF (www.iptc.org) 
and PRISM (www.prismstandard.org). Similarly there are standards dealing with the 
1 0 transfer and management of syndicated content like the Information and Content Exchange 
protocol (www.icestandard.org). 

The benefit of the proposed arrangement is that search engines can deliv^more consistent 
and hopefully accurate results, improviiig the user experience. 

15 

For dehvery to a mobile, this proposed arrangement is not technically constrained to the 
specifics of content encoding for i-mode or WAP. Equally, the following conversion are 
possible: 

XSLwml[XSLdevice_type[XSLpo$ition[XML]]] -> .Wml 

20 and 

XSLc4itmi[XSLdevicc.typc[XSLpcisiti0n[XML]]^ .html (of type c-html) 

This gives the foundation for a solution where the content providers export XML and the 
mobile operator's servers perform the final style-sheet manipulations, such as device type 

25 manipulations. This is advantageous as only the operator need hold the entire set of device 
specific style sheets, not the content provider. This creates a simple but effective control 
point, through the ownership of 'meaning' and device formatting, to impose business rules 
to ensure a good user experience. Furthermore, by making the conversion style sheets 
available to content providers, they can check how their content will be converted and 

30 presented. 
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Content Indexing Portals 

The foregoing approach enables the provision of content-indexing portals giving improved 
end-user experience, 

A foundation for a simple content indexing portal lies in the business rules for content 
providers and the content translations executed m the portal. Figure 4 illustrates an 
embodiment of a content-indexing portal working on these principles. The portal basic 
service is a personalised index for each mobile user that adapts with usage and qjecifically 
points to content the user has indicated they are interested in* 

The portal also controls content presentation. A clear knowledge of the XML stylesheet 
rules pemxits an XML content creator to be sure on how ttieir content will appear yet they 
do not need to create or buy all the potential XML translation style sheets. 

With XML based contrat and the final mobile content encoding in the portal, migration to 
WAP-NG is integrated in the design. 

To ensure a good user experience, the content indexing portal relies on rules that state: 

• The limited set of screen sizes and colours that are supported - device profiles 

• How content is obtained fiom contmt providers 0>rotocol) - content providers will be 
required to export content in the agreed XML formats using http or https, where the 
XML documents appear singly as another mime type. 

• How the content providers provide ttie content in a consistent manner - there will be 
one standard meaning for the XML vocabulary for all content; to achieve this operator 
(or possibly another party) will define, in simple terms, the meaning for all content 
types. 

The portal/server, owned by the network operator, will then provide basic index page 
services exactly as in i-mode or simple WAP portals and the content will be collected in 
XML. This will be transfomied in the portal/server to create device specific content using 
XML style sheets (and other tools). 
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The portal/server may reference and forward content that is akeady formatted for WAP or 
c-html presentation. This will be transparently passed through the portal as required. 

Based on XML, with the agreed operator vocabulary meanings, the content indexing portal 
5 personalises and renders personal service menus. Localisation could easily be added. 
These web assets are then delivered by references to the services themselves. 

r 

Content and Service Aggregating Portals 
10 A content-indexing portal is in many ways only a partial solution to user needs — what is 
often really called for is an information and service aggregation portal where information 
is pulled together for a user rather than simply referenced. 

Figure 5 depicts, in simplified form, a content*aggregating portal in terms of the XML 
15 document flows for a c-html user-device environment. XML documents, in the agreed 
vocabulary, are collected by the portal. The content is then manipulated in various ways - 
aggregated, personalised, localised and converted. The diagram shows XSL stylesheets 
used for location and device type conversions. (The diagram only shows the XML paths, it 
does not show the paths required to relay original WAP and i-mode information without 
20 any manipulation.). Figure 6 is a functional block diagram of the content-aggregating 
portal. 

The detailed implementation of the portal will depend on requirements for performance. 
One issue is whether the content is always to be collected fresh fix>m the content provider 
25 when required or whether the content to be cached and the cache refreshed from time to 
time. Equally, it must be decided whether aggregation is to be done on demand or in 
advance. In a preferred embodiment, a 2 level content cache is provided, holding both 
raw content and pre-aggregated content. 

30 Whilst the content-aggregating portal could retrieve content based on pre-established 
indexes, it is preferred to provide a search engine to rasure that the most appropriate 
content is obtained. This search engine works in 2 ways 
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• specifically at the request of the user 

• speculatively in searching for the best content for typical requests and caching the 
content 

A filtering mode can also be implemented where new content is continuously evaluated 
against individual user profiles. The filtering mode can, for example, be supported by the 
C3ulead "Current Awareness" solution and allows users to be proactively notified about 
new intepresting content via, for example, SMS. In Figure 5, this could happen inside the 
"XML collector and aggregator"* block. 

In a content aggregating portal, the personalisation, location and content aggregation 
systems directly interact. With a consistent representation of contmt, management and 
aggregation is possible using standard ISV products including Interwoven, Broadvidon and 
ATG Dynamo. 

The Figure 5 content-aggregating portal provides the following benefits: 

• It integrates together valuable content in a manner to ensure a good user experience. 

• It does not impose technical restrictions on hybrid WAP and i-mode solutions. 

• It contains the foundations for extension into deeper content aggregation 

The content aggregating portal can be deployed using tiie Hewlett-Packard Mobile E- 
Services Architecture and specifically, the HP Bluestone J2EE and XML enviroimient as 
well as Total-e-mobile, The HP Mobile E-Service Architecture is depicted pictorially in 
Figure 7. 

A Generic Approach with Migration into E-Services 

The above-described content aggregating portal is a distributed XML document integration 
bus that uses standard web protocols but which had the operator effectively assuming the 
role of a standards body; while this is satisfactory in a single national market, it is less 
likely to work in a multinational environment. 

Equally, the focus of the above has been on presenting content rather than delivering useful 
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services. In a mobUe world, there wiU be a greater need for an activity, that is, a need to 
actively respond to "Change the time of my flight" by doing just that rather than showing a 
Ust of alternative flight times; also, changing the flight time is likely to have knock-on 
effects requiring action to change car hire booking, late arrival at the hotel, etc. 

5 

To resolve these issues a more generic portal architecture is envisioned. In this case ratho- 
than define a single set of meanings for the XML based content, a service registry is set up 
that contains many service offers, each using a pubUshed vocabulary with well understood 
meaning and purpose. The solution is now universally deployable without requiring the 
10 operator to become a standards authority rather, to become the holder of the critical 
services repository. Furthermore the solution is based on delivering e-services, i.e. 
executing activities, ratiier than reading web pages. 

To achieve this, each service offer must be wr^ed in a well-defined manner. Each e- 
15 service perfonns a task and can be registered, discovered and used. This is the underlying 
logic of HP's e"speak architectiire. Indeed, these e-services can be contents, activities, 
operator-services in the portal, like billing and location, or any other asset that can be 
placed on the Web. 

20 The generic form of the architecture could utiUse HP e"speak, creating a mobile e-service 
integration fiamework (see Figure 8). This can be deployed with the HP Bluestone Total- 
e-Mobile to create a whole solution, which can be hosted in the HP Mobile e-Services 
Architecture. 

25 A service integration fiamewoik can be based on HP e"speak, where all Web based assets 
are boated as e-services. Each is described in the registiy and a composite e-service is 
created fiiom the individual services. The portal, with service aggregation, would be 
constructed as a set of services, each delivering exactly the same logical blocks as before 
but rather than simply puUing XML documents from remote sites, the remote sites would 

30 offer their assets as an e-service. They too would wrap their assets in the well-defined 
manner, specified via HP e*»speak, and their content appears as a service - a description of 
the weather that can be animated rattier than textual description. 
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The simple example shown in Figure 9 shows a step-by-step operation of the service. It 
assumes that each service has already been registered in the Registry. 

5 In this example, auserrequestsfromtheportaltheirhomepage(l). The portal calls on the 
index generator (2), which collects personalisation (3) and location (4) information before 
going to ^e Registry to collect a list of SCTvices (5) for the home page (6). When a searvice 
is requested (7) aservice decorator is used to pull together the request (7.1.1, 7.1.2), which 
is forwarded with additional useful parameters (7.1.3.). Once the service responds then 

10 event logging (7.1.4) and billing (7.1.5) can be carried out before the final dehvery (7.2). 

The example shows diflferent types of resource that are wrapped as e-services and 
illustrates re-use of the personalisation and location servers. These are operator provided 
services ttiat are used in conjunction with third party services to deliver the whole user 
15 experience. Clearly anew service can be r^idly created by re-use of the e-sorvicestiiat are 

known by the Registry. 

In the foregoing, a three-phase architecture has been described progressing from content 
indexing portals, througji content aggregating portals into portals that support activity 
20 based e-services. The content semantics problem can be resolve in 2 ways - eco-system 
control over the meaning of XML content or by use of a Registry where the meaning of the 
content is clarified by the action. It will be appreciated that many variants are possible to 
the above described embodiments of the invention. 
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CLAIMS 



1. A method of providing content to users via a communication infrastructure, wherein a 
plurality of content providers each make available content in XML format, and an portal 
5 operator provides a content-indexing portal tbrougji which users can access content from 
the content providers, the portal using XML style sheets to apply at least one of the 
transformations to XML-based content retrieved from a content provider in response to a 
user request: 

user device ad^tation of the content; 
10 - user preferences adaptation; 

location-based adaptation; 
the content providers providing their content according to an agreed semantic meaning for 
the XML whereby to enable the portal to provide consistent user responses in respect of 
content from said plurality of content providers. 

15 

2. A method according to claim 1, wherein the sraiantic meaning of the XML is specified 
by the portal operator. 

3. A method of providing content to users via a commimication infrastructure, wherein a 
20 plurality of content providers each make available content in XML format, and an portal 

operator provides a content-aggregating portal through which users can access content 
from the content providers with the portal serving to aggregate content from multiple 
content providers as appropriate to provide a response to a user request, the portal using 
XML style sheets to apply at least one of the transfomiations to XML-based content 
25 retrieved from a content provider in response to a user request: 

user device adaptation of the content; 

user preferences adaptation; 

location-based adaptation; 
the content providers providing their content according to an agreed semantic meaning for 
30 the XML whereby to enable the portal to provide consistent user responses when 
aggregating from multiple said content providers. 
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4. A . ihod according to claim 3, wherein the agreed semantic meaning of the XML is 
specified by the portal operator. 

5 5. A method according to claim 3, wherein the portal includes a two-level cache for 
caching both raw content obtained from the content providers and for caching content 
aggregated by the portal. 

6. A method according to claim 3 or claim 5, wherein the portal aggregates and caches 
1 0 content in advance of any related request 

7. A method according to claim 6, wherein the content aggre^tion is efTected in 
accordance with predetermined user content-p^onalisation profiles. 

15 8. A method of providing services to uscts via a communication infrastracture, wherein a 
pliirality of service providers each make available content in XML format associated with 
services, and an portal operator provides a portal through which users can access content 
from the service providers, the portal using XML style sheets to apply at least one of the 
transfoimations to XML-based content retrieved from a content provider in response to a 
20 user request: 

user device adaptation of the content; 
user preferences ad£Q>tation; 
location-based adaptation; 
the service providers providing their content according to an agreed semantic meaning for 
25 the XML whereby to enable the portal to provide consistent user responses in respect of 
services from said plurality of service providers. 

9. A method according to claim 8, wherein the common XML semantic is specified in a 
registry run by the portal operator in association with actions associated with the related 
30 services. 
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10. A method acx^ording to claim 8, wherein the portal includes a two-level cache for 
caching both raw content obtained from the service providers and for caching content 
aggregated by the portal. 

5 11. A method according to claim 8 or claim 10, wherein the portal aggregates and caches 
content in advance of any related request. 

r 

12. A method according to claim 8, wherein the content aggregation is effected in 
accordance with predetermined user service-personalisation profiles. 

10 
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